STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: ,' S LF.^P 

Date Processed by STIC: / 1 I^/O'J 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4.0 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 

User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service r or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 

Revised 01/10/06 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 \J Wrapped Nucleics 
Wrapped Aminos 



', /o/SS3. VS? 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 M isaligned Amino 
Numbering 



Non-ASCII 



10 



11 



12 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. _ 



Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

Patentln 2.0 A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

"bug" sequences(s) Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence: 



^Skipped Sequences 
(NEW RULES) 



_Use of n's or Xaa's 
(NEW RULES) 



(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence, (see item 1 1 below) 

_Use of <220> Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. Use 

of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or "Unknown." 
Please explain source of genetic material in <220> to <223> section or use "chemically synthesized" as 
explanation. (See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32), also Sec. 1.823 of 
Sequence Rules 

_PatentIn 2.0 Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



Jnvalid <213> 
Response 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide: "Xaa" can only represent a single amino acid 
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Jo fhc^ 





IFWP 




w- 
w- 



-> 
-> 



1/WO 2 004/0 9 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/553,459 

Input Set : A:\PTO.AMC.txt 
Output Set: N:\CRF4\01232007\J553459. 



DATE: 
TIME: 



01/23/2007 
16:44:18 



raw 



Paul 



W--> 
THEREOF 

12 

C--> 
C--> 



PCT/US2004/01162 

4 <110> APPLICANT: Hallenbeck, 

5 Hampton, Garret 

6 Hay, Carl 

7 Huang , Ying 

8 Jakubczak, John 

9 Phipps, Sandrina 
10 <120> TITLE OF INVENTION: FLAP ENDONUCLEASE 



Does Not Comply 

Corrected Diskette Meeded 

/ (fr\ ^i^^ 



1 (FENl) REGULATORY SEQUENCES AND USES 



W--> 



<130> FILE REFERENCE: GTIN-021WO 

14 <140> CURRENT APPLICATION NUMBER: US/10/553,459 

14 <141> CURRENT FILING DATE: 2005-10-17 

14 <150> PRIOR APPLICATION NUMBER: 60/463,148 

15 <151> PRIOR FILING DATE: 2003-04-15 

16 <160> NUMBER OF SEQ ID: 8 

17 <170> SOFTWARE: FastSEQ for Windows Version 4.0 




sapiens 



ERRORED SEQUENCES 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 2259 

21 <212> TYPE: DNA 

22 <213> ORGANISM: H. 
W--> 23 <400> SEQUENCE: 1 

24 catgcggtta tcaaggagcc tggtgctgcc 

25 tgtagctgca gatcaatggc ccttattagc 

26 gaaaacgtga ctaaaggtac gaactctctg 

2 7 tgcctacaat ttccggagct ttgtggacca 

28 gctataagtc actgactggt ggtacccaga 

29 cattttttgg acggcgtctc actctgtcgc 

30 ctcactgcaa cctctgcctc ccgggttcta 

31 ctgggactat aggattacag gtgcgcacca 

32 atggggtttt gccatgctgg ccaggatggt 

33 gcctcggcct cccaaagtac* tgagattaca 

34 tttttttttt ccagacagtc ttgctctatc 

3 5 tgccaactcg gctcactgca agctccgcct 
36 ctcccgagca gctgggatta caggtgtgaa 

E--> 37 taggagagac agggtttcac cttgctggcc 

38 ccgctcgcct cagcctccca aagtgctggg 

39 tcaatttcta attcagtatt ttcctcacta 
E--> 40 ggtcaagaca ttcaagttct ggttctgagt 

41 tctatttcac agaactgaaa aataagaaag 




gtgaaacaga 

attttctgag 

cctgagaaaa 

gtgtctatag 

tctcaatatc 

ccgggctgga 

gagattctca 

ccacatctaa 

cttgaattcc 

ggtgtgagtt 

gcccaggctg 

tctgggttca 

ccaccatg cc 

aggctggt0t~ 

attacaggag 

cctatgctat 

aatctgagjf): 

atgatgaatc 



ggctgatttt 
gccaataatc 
accacataca 
acaccaagct 
tttttttttt 
gggcagtggc 
tacctcagcc 
tttttgtatt 
tgacctcagg 
gccgcgccca 
gagtgcctgg 
agtgattatc 
cggctaattt 



agcccggaaa 


60 


tgaccactat 


120 


agaaaaagtt 


180 


gagaaccccc 


240 


gacggagtct 


300 


acgatctcgg 


360 


tctcgagtag 


420 


tttagtagag 


480 


tgatctgcct 


540 


ggctcaattt 


600 


agtgcagtgg 


660 


ctgcctcagc 


720 


tttgtatttt 


780 



tgaacttctg 

tgaaccaccg 
tflt.ggaaf.nt-, 



acctcctgat 

cgcctggccc 
tgtgagrtat 



840 

900 
960 



tgagtaaagc 

aaagcatcta 



gactgtaata 

gtgcctagca 



1020 

1080 



file://C:\CRF4\OUTHOLD\VsrJ553459.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/553,459 



DATE: 01/23/2007 
TIME: 16:44:18 



42 

E--> 43 
44 
45 
46 
47 
48 
49 
50 
51 

E--> 52 

53 
54 

E--> 55 

56 
57 
58 

E--> 5ft 
E--> 
E--> $5 
E--> 
E--> 69 
E--> 70 
E--> 71 
E--> 72 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\01232007\J553459.raw 



gggagtattt 
ataatgtctt 

tgtcaccggt 
gcccccattg 
caaaatgata 
gcactgagaa 
gtcactagga 
agaggctgtg 
ccaataaaca 
aaa taaagat 
gc< 
gggc 



jgctt 



tgctcaacag 
ttatgaaaga 

cagccaatgc 
ttctgagaca 
aagacaatca 
tacaaagatg 
ggcgtggccg 
ggcgctccta 
tttgcccagg 
aacacaccac 



qtat ttqctt ccttcctaaa qctataggga a g^gat»g*g n An 



'If 
gggc|/:gtaa 

ttctaaaatc 

ctggaattca 

ctgccattta 

aatagacatc 

aaaacaaaga 

ggaaagtctg 

gcattgtagg 

cgataacaga 



cgtaaagatc 

cagaacataa 
attcagtaaa 
ttgagcttcc 
atcccagagc 
agtccatgga 
gccaagtgcc 
atgggcacgg 
gatatacaaa 



tgtacaaatg 

caactctaga 
caatcacggc 
aattacgggc 
tagatgcgcg 
acgtggccag 
tgagagttgg 
gttcggcaga 
ctgqaaqqta 



ttaacttcat 

gaagtaaact 
ccccttcccc 
cctctgtttg 
tcagacggtg 
agatctgtac 
aagtgcttca 
agaactttcc 
ttcaaaattc 



1200 

1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



:acgcc 

:tcccc 
tggatctccg 
atctccgtct 
cgctccgcca 
agtgcctccg 
gascasgggc 



tctcgccctt 

cttccccacc 
gcgtctgagg 
ggaactcccc 
ccggaagaac 
gcgcaagtgg 
ctgcggggac 



agaaatcgcg 

cttcagccca 
agataagcgc 
tcaacgctct 
acgtcgacag 
cattgaggga 
ctggtgtggg 




agctgagaaa 

agccggaggt 
ggtgtgggtc 
caccattttg 
gagcaggcgc 
cttgtagtcc 
tggagtgggg 



cctaaggagt 

tccaggagcg 
agaccccgag 
ccccgcgaag 
ctagcacaac 
tgcgatttcg 
acaagcggtg 



tcatggcaag 

tctagccctc 
gggtcctcgc 



1740 

1800 
1860 



gfejtaatccgc 

cggaaaagga 
ggtgtagagg 
gagaagggta 



1920 

1980 
2040 
2100 



cgccagggtc gctgagagac tctgttctcc ctggagggac tggttgccat gagagcagcc 2160 
gtctgagggg acgcagcctg cactacgcgc cccaagaggc tgtgcgtggc gagcaggtca 2220 
cgtgacggga gcgcgggctt tggaaggcgg ctgaacgtc ^ 2-ZS? 



2259- 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 01/23/2007 

PATENT APPLICATION: US/10/553 , 459 TIME: 16:44:19 

Input Set : A:\PTO.AMC.txt 
Output Set: N:\CRF4\01232007\J553459.raw 

Invalid Line Length; 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:l; Line(s) 71 
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<210> 8 
<211> 270 




<212> DNA 



VERIFICATION SUMMARY DATE: 01/23/2007 

PATENT APPLICATION: US/10/553,459 TIME: 16:44:19 

Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\01232007\J553459.raw 

L:l M:259 W: Allowed number of lines exceeded, (1) GENERAL INFORMATION: 

L:2 M:259 W: Allowed number of lines exceeded, (1) GENERAL INFORMATION: 

L:10 M:283 W: Missing Blank Line separator, <120> field identifier 

L:14 M:270 C: Current Application Number differs, Replaced Current Application No 

L:14 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:16 M:283 W: Missing Blank Line separator, <160> field identifier 

L:23 M:283 W: Missing Blank Line separator, <400> field identifier 

L:37 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:1 

L:40 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 1 

L:43 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 1 

L:52 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:1 

L:55 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:1 

L:59 M:254 E: No. of Bases conflict, this line has no nucleotides. 

M:254 Repeated in SeqNo=l 

(1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS: 6 
(48) String data converted to lower case, 

(1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS : 7 
M:112 Repeated in SeqNo=l 

L:72 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 2259 Found:2269 SEQ:1 

L:78 M:283 W: Missing Blank Line separator, <400> field identifier 

L:87 M:283 W: Missing Blank Line separator, <400> field identifier 

L:97 M:283 W: Missing Blank Line separator, <400> field identifier 

L:103 M:283 W: Missing Blank Line separator, <400> field identifier 

L:109 M:283 W: Missing Blank Line separator, <400> field identifier 

L:115 M:283 W: Missing Blank Line separator, <400> field identifier 

L:119 M:259 W: Allowed number of lines exceeded, <211> LENGTH: 

L:122 M:259 W: Allowed number of lines exceeded, <211> LENGTH: 

L:125 M:259 W: Allowed number of lines exceeded, <211> LENGTH: 

L:126 M:259 W: Allowed number of lines exceeded, <211> LENGTH: 

L:131 M:283 W: Missing Blank Line separator, <400> field identifier 



L:62 M:320 E 
L:62 M:112 C 
L:65 M:320 E 
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